Picture for Chun Yuan

Chun Yuan

Generation Enhances Understanding in Unified Multimodal Models via Multi-Representation Generation

Add code
Jan 29, 2026
Viaarxiv icon

What Makes Low-Bit Quantization-Aware Training Work for Reasoning LLMs? A Systematic Study

Add code
Jan 21, 2026
Viaarxiv icon

Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models

Add code
Nov 13, 2025
Figure 1 for Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models
Figure 2 for Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models
Figure 3 for Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models
Figure 4 for Learning to Pose Problems: Reasoning-Driven and Solver-Adaptive Data Synthesis for Large Reasoning Models
Viaarxiv icon

Towards Implicit Aggregation: Robust Image Representation for Place Recognition in the Transformer Era

Add code
Nov 08, 2025
Viaarxiv icon

Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications

Add code
Oct 31, 2025
Figure 1 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Figure 2 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Figure 3 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Figure 4 for Sparse Model Inversion: Efficient Inversion of Vision Transformers for Data-Free Applications
Viaarxiv icon

FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution

Add code
Oct 14, 2025
Figure 1 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 2 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 3 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Figure 4 for FlashVSR: Towards Real-Time Diffusion-Based Streaming Video Super-Resolution
Viaarxiv icon

Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation

Add code
Aug 12, 2025
Figure 1 for Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
Figure 2 for Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
Figure 3 for Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
Figure 4 for Unified and Semantically Grounded Domain Adaptation for Medical Image Segmentation
Viaarxiv icon

Text-guided Visual Prompt DINO for Generic Segmentation

Add code
Aug 08, 2025
Figure 1 for Text-guided Visual Prompt DINO for Generic Segmentation
Figure 2 for Text-guided Visual Prompt DINO for Generic Segmentation
Figure 3 for Text-guided Visual Prompt DINO for Generic Segmentation
Figure 4 for Text-guided Visual Prompt DINO for Generic Segmentation
Viaarxiv icon

UniGlyph: Unified Segmentation-Conditioned Diffusion for Precise Visual Text Synthesis

Add code
Jul 02, 2025
Viaarxiv icon

A Simple Linear Patch Revives Layer-Pruned Large Language Models

Add code
May 30, 2025
Figure 1 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Figure 2 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Figure 3 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Figure 4 for A Simple Linear Patch Revives Layer-Pruned Large Language Models
Viaarxiv icon